Six approaches to limited domain concatenative speech synthesis

نویسندگان

  • Robert J. Utama
  • Ann K. Syrdal
  • Alistair Conkie
چکیده

This paper (based on an MS Thesis by Robert Utama in the Electrical and Computer Engineering department at Rutgers University) describes 6 limited-domain Text-to-Speech (TTS) systems that are constrained to the digit string and natural number domains (cardinal numbers only). Each of the 6 unit selection-based concatenative TTS systems were implemented in MATLAB. We evaluate and discuss various factors that influenced the naturalness or overall quality of the synthesized voice. Some of the factors studied were the length and type of the synthesis unit and the extent of co-articulation represented in the recorded speech database. We show that it is possible to create a high quality limited domain TTS system either with maximal or with carefully controlled minimal effects of co-articulation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Introduction to multilingual corpus-based concatenative speech synthesis

This tutorial paper addresses foreign-language support in corpus-based concatenative text-to-speech systems. We give an overview of application domains where strictly monolingual speech synthesis is not sufficient and where multilingual text-to-speech is required or highly desirable. We describe two approaches to multilingual corpus-based speech synthesis: phoneme mapping on the one hand, and t...

متن کامل

Spectral smoothing for concatenative speech synthesis

This paper addresses the topic of performing e ective concatenative speech synthesis with a limited database by proposing methods to smooth the transitions between speech segments. The objective is to produce naturalsounding speech via segment concatenation when formants and other spectral features do not align properly. We propose several methods for adjusting the spectra between waveform segm...

متن کامل

مراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی

Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...

متن کامل

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Limitations to concatenative speech synthesis

This paper discusses techniques for determining the linguistic needs for open-domain synthesis by concatenative methods, and reports on the design and evaluation of a tool for collecting and balancing a speech corpus automatically, in order to ensure optimal coverage of the sounds required for synthesis within a given task-domain. Syntheticallygenerated utterances are used to prompt speakers, a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006